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Amendments to the Claims : 
This listing of claims replaces all prior versions and listings of claims in the application: 

Listing of Claims : 

1 . (Currently Amended) A method for indexing documents in a collection of documents, 
each document comprising one or more index terms, the method comprising: 

determining a value x such that at least a majority of the index terms occur in x 
documents or fewer , where x is an integer ; 

determining a value y, where y is not equal to x , where v is an integer ; 

generating an inverted index for the collection of documents, the inverted index including 
an inverted list for each of the index terms, each inverted list including at least one posting and, 
if the number of postings exceeds x, further including a skip entry after the X th posting and one or 
more skip entries thereafter at intervals of every y th posting; 

wherein: 

a posting includes a document identifier identifying a document in the collection 

of documents; 

a skip entry includes a document identifier that is included in a boundary posting 
of a block of postings immediately adjacent to the skip entry in the inverted list, where a block of 
postings includes postings having document identifiers ranging from a lower to an upper value 
and where a boundary posting is a posting having a document identifier of either the lower or the 
u pper value , 

2. (Currently Amended) The method of claim 1 , wherein each posting further includes a 
docum e nt id e ntifier identifying a document in th e collection of docum e nts, a position identifier 
identifying a position of the index term in the document, and a frequency of the index term 
occurring in the document. 
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3 . (Currently Amended) The method of claim 1 , wherein the boundary posting includes a 
document identifier having the lower value in the range of values and the block of a skip e ntry 
id e ntifi e s the small e st docum e nt numb e r of docum e nts includ e d in th e postings immediately 
following follow the skip entry in the inverted list. 

4. (Original) The method of claim 3, wherein the skip entry further includes information to 
locate the next skip entry in the inverted list. 

5. (Currently Amended) The method of claim 1 , wherein the boundary posting includes a 
document identifier having the higher value in the range of values and the block o f a skip entry 
id e ntifi e s th e larg e st docum e nt numb e r of docum e nts included in th e postings immediately 
pr e cedin g precede the skip entry in the inverted list. 

6. (Original) The method claim 5, wherein the skip entry further includes information to 
locate the next skip entry in the inverted list. 

7. (Original) The method of claim 1, wherein y is less than x. 

8. (Original) The method of claim 1 , wherein x is in the range of 256 to 5 1 2 and y is in the 
range of 128 to 256. 

9. (Original) The method of claim 1, wherein the collection of one or more documents 
includes one or more binary files, data tables, source code files, text documents or combinations 



thereof. 



10. (Original) The method of claim 1, further comprising: 
compressing the inverted index. 



1 1 . (Original) The method of claim 1 , wherein substantially all of the index terms occur in x 
documents or fewer. 
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12. (Original) The method of claim 11, wherein approximately 80 to 90% of the index terms 
occur in x documents or fewer. 

13. (Original) The method of claim 1, wherein for each inverted list, if the number of 
postings exceeds x, further including a skip entry before the first posting in the inverted list. 

14. (Original) The method of claim 1, wherein for each inverted list, if the number of 
postings exceeds x, further including a skip entry after the last posting in the inverted list. 

15. (Currently Amended) A method for indexing documents, the method comprising: 
receiving a collection of documents, each document comprising one or more index 

terms; 

determining a value x, wherein at least a majority of the index terms occur in x 
documents or fewer and x is an integer in the range of 256 to 512; 

determining a value y, wherein y is not equal to the value x and is an integer in the range 
of 128 to 256; and 

generating an inverted index for the collection of documents, the inverted index including 
an inverted list for each of the index terms, each inverted list including at least one posting and, 
if the number of postings exceeds x, further including a skip entry after the X th posting and one or 
more skip entries thereafter at intervals of every y th posting; 

wherein: 

a posting includes a document identifier identifying a document in the collection 

of documents; 

a skip entry includes a document identifier that is included in a boundary posting 
of a block of postings immediately adjacent to the skip entry in the inverted list, where a block of 
postings includes postings having document identifiers ranging from a lower to an upper value 
and where a boundary posting is a posting having a document identifier of either the lower or the 
upper value . 
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1 6. (Currently Amended) The method of claim 1 5, wherein each posting further includes a 
docum e nt id e ntifi e r id e ntifying a docum e nt in th e coll e ction of docum e nts, a position identifier 
identifying a position of the index term in the document, and a frequency of the index term 
occurring in the document. 

17. (Currently Amended) The method of claim 15, wherein the boundary posting includes a 
document identifier having the lower value in the range of values and the block of a skip entry 
id e ntifi e s th e small e st document numb e r of docum e nts included in th e postings immediately 
followin g follows the skip entry in the inverted list. 

18. (Original) The method of claim 17, wherein the skip entry further includes information 
to locate the next skip entry in the inverted list. 

1 9. (Currently Amended) The method of claim 1 5, wherein the boundary posting includes a 
document identifier having the higher value in the range of values and the block o f a skip entry 
id e ntifies th e larg e st docum e nt number of documents includ e d in th e postings immediately 
pr e c e din g precede the skip entry in the inverted list. 

20. (Original) The method claim 15, wherein the skip entry further includes information to 
locate the next skip entry in the inverted list. 

21 . (Original) The method of claim 1 5, wherein substantially all of the index terms occur in 
x documents or fewer. 

22. (Original) The method of claim 21, wherein approximately 80 to 90% of the index terms 
occur in x documents or fewer. 

23. (Original) The method of claim 1 5, wherein for each inverted list, if the number of 
postings exceeds x, further including a skip entry before the first posting in the inverted list. 

24. (Original) The method of claim 1 5, wherein for each inverted list, if the number of 
postings exceeds x, further including a skip entry after the last posting in the inverted list. 
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25. (Currently Amended) An inverted index for a collection of documents, each document 
comprising one or more index terms, the inverted index comprising: 

an inverted list for each index term in the collection of documents; and 
one or more inverted lists including a quantity of postings that exceeds a value x, a skip 
entry after the X th posting, and one or more additional skip entries thereafter at intervals of every 
y th posting, where the value x is such that at least a majority of the index terms occur in x 
documents or fewer, and the value y is not equal to the value x and x and v are integers; 
wherein: 

a posting includes a document identifier identifying a document in the collection 

of documents; 

a skip entry includes a document identifier that is included in a boundary posting 
of a block of postings immediately adjacent to the skip entry in the inverted list, where a block of 
postings includes postings having document identifiers ranging from a lower to an upper value 
and where a boundary posting is a posting having a document identifier of either the lower or the 
upper value . 

26. (Currently Amended) The inverted index of claim 25, wherein each posting further 
includ e s a docum e nt id e ntifi e r id e ntifying a document in th e coll e ction of docum e nts, a position 
identifier identifying a position of the index term in the document, and a frequency of the index 
term occurring in the document. 

27. (Currently Amended) The inverted index of claim 25, wherein the boundary posting 
includes a document identifier having the lower value in the range of values and the block of a 
s kip entry id e ntifies th e smallest docum e nt numb e r of docum e nts included in th e postings 
immediately followin g follow the skip entry in the inverted list. 



28. (Original) The inverted index of claim 27, wherein the skip entry further includes 
information to locate the next skip entry in the inverted list. 
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29. (Currently Amended) The inverted index of claim 25, wherein the boundary posting 
includes a document identifier having the higher value in the range of values and the block of a 
skip e ntry id e ntifies th e larg e st document numb e r of docum e nts includ e d in th e postings 
immediately pr e c e din g precede the skip entry in the inverted list. 

30. (Original) The inverted index of claim 29, wherein the skip entry further includes 
information to locate the next skip entry in the inverted list. 

3 1 . (Original) The inverted index of claim 25, wherein x is in the range of 256 to 5 1 2 and y 
is in the range of 128 to 256. 

32. (Original) The inverted index of claim 25, wherein substantially all of the index terms 
occur in x documents or fewer. 

33. (Original) The inverted index of claim 32, wherein approximately 80 to 90% of the index 
terms occur in x documents or fewer. 

34. (Original) The inverted index of claim 25, wherein the collection of one or more 
documents includes one or more binary files, data tables, source code files, text documents or 
combinations thereof. 

35. (Original) The inverted index of claim 25, wherein the one or more inverted lists further 
include a skip entry before the first posting in the inverted list. 

36. (Original) The inverted index of claim 25, wherein the one or more inverted lists further 
include a skip entry after the last posting in the inverted list. 

37. (Currently Amended) An article comprising a machine-readable medium storing 
instructions operable to cause one or more machines to perform operations comprising: 

determining a value x such that at least a majority of the index terms occur in x 
documents or fewe r, where x is an integer ; 

determining a value y, where y is not equal to x and where v is an integer ; 
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generating an inverted index for the collection of documents, the inverted index including 
an inverted list for each of the index terms, each inverted list including at least one posting and, 
if the number of postings exceeds x, further including a skip entry after the X th posting and one or 
more skip entries thereafter at intervals of every y th posting; 

wherein: 

a posting includes a document identifier identifying a document in the collection 

of documents; 

a skip entry includes a document identifier that is included in a boundary posting 
of a block of postings immediately adjacent to the skip entry in the inverted list, where a block of 
postings includes postings having document identifiers ranging from a lower to an upper value 
and where a boundary posting is a posting having a document identifier of either the lower or the 
upper value , 

38. (Currently Amended) The article of claim 37, wherein each posting further includes a 
docum e nt identifier id e ntifying a docum e nt in th e coll e ction of docum e nts, a position identifier 
identifying a position of the index term in the document, and a frequency of the index term 
occurring in the document. 

39. (Currently Amended) The article of claim 37, wherein the boundary posting includes a 
document identifier having the lower value in the range of values and the block of a skip e ntry 
identifi e s the small e st docum e nt numb e r of documents includ e d in th e postings immediately 
followin g follow the skip entry in the inverted list. 

40. (Original) The article of claim 39, wherein the skip entry further includes information to 
locate the next skip entry in the inverted list. 

41 . (Currently Amended) The article of claim 37, wherein the boundary posting includes a 
document identifier having the higher value in the range of values and the block of a skip entry 
identifi e s the larg e st document numb e r of documents includ e d in the postings immediately 
procedin g precede the skip entry in the inverted list. 
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42. (Original) The article of claim 41 , wherein the skip entry further includes information to 
locate the next skip entry in the inverted list. 

43. (Original) The article of claim 37, wherein y is less than x. 

44. (Original) The article of claim 37, wherein x is in the range of 256 to 5 12 and y is in the 
range of 128 to 256. 

45. (Original) The article of claim 37, wherein the collection of one or more documents 
includes one or more binary files, data tables, source code files, text documents or combinations 
thereof. 

46. (Original) The article of claim 37, further comprising instructions operable to cause one 
or more machines to perform operations comprising: 

compressing the inverted index. 

47. (Original) The article of claim 37, wherein substantially all of the index terms occur in x 
documents or fewer. 

48. (Original) The article of claim 47, wherein approximately 80 to 90% of the index terms 
occur in x documents or fewer. 

49. (Original) The article of claim 37, wherein for each inverted list, if the number of 
postings exceeds x, further including a skip entry before the first posting in the inverted list. 

50. (Original) The article of claim 37, wherein for each inverted list, if the number of 
postings exceeds x, further including a skip entry after the last posting in the inverted list. 

5 1 . (Currently Amended) An article comprising a machine-readable medium storing 
instructions operable to cause one or more machines to perform operations comprising: 

receiving a collection of documents, each document comprising one or more index 

terms; 
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determining a value x, wherein at least a majority of the index terms occur in x 
documents or fewer and x is an integer in the range of 256 to 5 12; 

determining a value y, wherein y not equal to the value of x and is an integer in the range 
of 128 to 256; 

generating an inverted index for the collection of documents, the inverted index including 
an inverted list for each of the index terms, each inverted list including at least one posting and, 
if the number of postings exceeds x, further including a skip entry after the X th posting and one or 
more skip entries thereafter at intervals of every y th posting; 

wherein: 

a posting includes a document identifier identifying a document in the collection 

of documents; 

a skip entry includes a document identifier that is included in a boundary posting 
of a block of postings immediately adjacent to the skip entry in the inverted list, where a block of 
postings includes postings having document identifiers ranging from a lower to an upper value 
and where a boundary posting is a posting having a document identifier of either the lower or the 
upper value , 

52. (Currently Amended) The article of claim 5 1 , wherein each posting further includes a 
docum e nt identifi e r id e ntifying a docum e nt in th e coll e ction of documents, a position identifier 
identifying a position of the index term in the document, and a frequency of the index term 
occurring in the document. 

53 . (Currently Amended) The article of claim 5 1 , wherein the boundary posting includes a 
document identifier having the lower value in the range of values and the block of a skip e ntry 
id e ntifi e s th e s mallest docum e nt numb e r of docum e nts included in the postings immediately 
followin g follow the skip entry in the inverted list. 

54. (Original) The article of claim 53, wherein the skip entry further includes information to 
locate the next skip entry in the inverted list. 
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55. (Currently Amended) The article of claim 5 1 , wherein the boundary posting includes a 
document identifier having the higher value in the range of values and the block of a skip entry 
id e ntifi e s the larg e st document numb e r of docum e nts includ e d in th e postings immediately 
pr e c e ding precede the skip entry in the inverted list. 

56. (Original) The article of claim 51, wherein the skip entry further includes information to 
locate the next skip entry in the inverted list. 

57. (Original) The article of claim 51, wherein substantially all of the index terms occur in x 
documents or fewer. 

58. (Original) The article of claim 57, wherein approximately 80 to 90% of the index terms 
occur in x documents or fewer. 

59. (Original) The article of claim 5 1 , wherein for each inverted list, if the number of 
postings exceeds x, further including a skip entry before the first posting in the inverted list. 

60. (Original) The article of claim 5 1 , wherein for each inverted list, if the number of 
postings exceeds x, further including a skip entry after the last posting in the inverted list. 



